AITopics | sampling technique

Collaborating Authors

sampling technique

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Sampling Techniques for Kernel Methods

Neural Information Processing SystemsApr-6-2023, 16:37:38 GMT

We propose randomized techniques for speeding up Kernel Principal Component Analysis on three levels: sampling and quantization of the Gram matrix in training, randomized rounding in evaluating the kernel expansions, and random projections in evaluating the kernel itself. In all three cases, we give sharp bounds on the accuracy of the obtained ap- proximations. Rather intriguingly, all three techniques can be viewed as instantiations of the following idea: replace the kernel function by a "randomized kernel" which behaves like

kernel method, sampling technique

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Kernel Methods (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.34)

Add feedback

A Complete Guide on Sampling Techniques for Data Science

#artificialintelligenceSep-26-2021, 17:26:11 GMT

In this guide, I will share a detailed deep-dive of what is sampling, what are sampling techniques, and the industry use cases. As you know, fundamental to Data Science is getting good quality sample data. We always derive population parameters from the sample. Our machine learning models will not yield the desired results, if the sample data we worked on does not closely represent the population. In sampling, we select a group of individuals from a target population. This group of individuals forms a sample.

representation, sampling technique, target population, (15 more...)

#artificialintelligence

Country: Asia > India (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.88)
Information Technology > Data Science (0.86)

Add feedback

An Empirical Study on Predictability of Software Code Smell Using Deep Learning Models

Gupta, Himanshu, Kulkarni, Tanmay G., Kumar, Lov, Neti, Lalita Bhanu Murthy, Krishna, Aneesh

arXiv.org Artificial IntelligenceAug-8-2021

Code Smell, similar to a bad smell, is a surface indication of something tainted but in terms of software writing practices. This metric is an indication of a deeper problem lies within the code and is associated with an issue which is prominent to experienced software developers with acceptable coding practices. Recent studies have often observed that codes having code smells are often prone to a higher probability of change in the software development cycle. In this paper, we developed code smell prediction models with the help of features extracted from source code to predict eight types of code smell. Our work also presents the application of data sampling techniques to handle class imbalance problem and feature selection techniques to find relevant feature sets. Previous studies had made use of techniques such as Naive - Bayes and Random forest but had not explored deep learning methods to predict code smell. A total of 576 distinct Deep Learning models were trained using the features and datasets mentioned above. The study concluded that the deep learning models which used data from Synthetic Minority Oversampling Technique gave better results in terms of accuracy, AUC with the accuracy of some models improving from 88.47 to 96.84.

dataset, hypothesis, significant improvement, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/978-3-030-75075-6_10

2108.04659

Country:

Oceania > Australia (0.04)
Asia > Middle East > Oman (0.04)
Asia > India (0.04)

Genre:

Research Report > Experimental Study (0.96)
Research Report > New Finding (0.90)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Data Scientist's Guide to 8 Types of Sampling Techniques

#artificialintelligenceSep-11-2019, 05:53:35 GMT

Here's a scenario I'm sure you are familiar with. You download a relatively big dataset and are excited to get started with analyzing it and building your machine learning model. And snap – your machine gives an "out of memory" error while trying to load the dataset. It's happened to the best of us. It's one of the biggest hurdles we face in data science – dealing with massive amounts of data on computationally limited machines (not all of us have Google's resource power!).

artificial intelligence, machine learning, subgroup, (17 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (0.72)
Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

Learning Simulation Control in General Game-Playing Agents

Finnsson, Hilmar (Reykjavik University) | Björnsson, Yngvi (Reykjavik University)

AAAI ConferencesJul-15-2010

The aim of General Game Playing (GGP) is to create intelligent agents that can automatically learn how to play many different games at an expert level without any human intervention. One of the main challenges such agents face is to automatically learn knowledge-based heuristics in real-time, whether for evaluating game positions or for search guidance. In recent years, GGP agents that use Monte-Carlo simulations to reason about their actions have become increasingly more popular. For competitive play such an approach requires an effective search-control mechanism for guiding the simulation playouts. In here we introduce several schemes for automatically learning search guidance based on both statistical and reinforcement learning techniques. We compare the different schemes empirically on a variety of games and show that they improve significantly upon the current state-of-the-art in simulation-control in GGP. For example, in the chess-like game Skirmish, which has proved a particularly challenging game for simulation-based GGP agents, an agent employing one of the proposed schemes achieves 97% winning rate against an unmodified agent.

agent, mast, simulation, (16 more...)

AAAI Conferences

Twenty-Fourth AAAI Conference on Artificial Intelligence

Country:

North America > Canada > Alberta (0.14)
Europe > Iceland > Capital Region > Reykjavik (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

Sampling Techniques for Kernel Methods

Achlioptas, Dimitris, Mcsherry, Frank, Schölkopf, Bernhard

Neural Information Processing SystemsDec-31-2002

We propose randomized techniques for speeding up Kernel Principal Component Analysis on three levels: sampling and quantization of the Gram matrix in training, randomized rounding in evaluating the kernel expansions, and random projections in evaluating the kernel itself. In all three cases, we give sharp bounds on the accuracy of the obtained approximations. Rather intriguingly, all three techniques can be viewed as instantiations of the following idea: replace the kernel function by a "randomized kernel" which behaves like in expectation.

artificial intelligence, machine learning, matrix, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Sampling Techniques for Kernel Methods

Achlioptas, Dimitris, Mcsherry, Frank, Schölkopf, Bernhard

Neural Information Processing SystemsDec-31-2002

We propose randomized techniques for speeding up Kernel Principal Component Analysis on three levels: sampling and quantization of the Gram matrix in training, randomized rounding in evaluating the kernel expansions, and random projections in evaluating the kernel itself. In all three cases, we give sharp bounds on the accuracy of the obtained approximations. Rather intriguingly, all three techniques can be viewed as instantiations of the following idea: replace the kernel function by a "randomized kernel" which behaves like in expectation.

evaluation, kernel, matrix, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Connecticut > New Haven County > New Haven (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Sampling Techniques for Kernel Methods

Achlioptas, Dimitris, Mcsherry, Frank, Schölkopf, Bernhard

Neural Information Processing SystemsDec-31-2002

We propose randomized techniques for speeding up Kernel Principal Component Analysis on three levels: sampling and quantization of the Gram matrix in training, randomized rounding in evaluating the kernel expansions, and random projections in evaluating the kernel itself. In all three cases, we give sharp bounds on the accuracy of the obtained approximations. Ratherintriguingly, all three techniques can be viewed as instantiations of the following idea: replace the kernel function by a "randomized kernel" which behaves like in expectation.

artificial intelligence, machine learning, matrix, (17 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback